A Large-Scale Characterization of How Readers Browse Wikipedia
نویسندگان
چکیده
Despite the importance and pervasiveness of Wikipedia as one largest platforms for open knowledge, surprisingly little is known about how people navigate its content when seeking information. To bridge this gap, we present first systematic large-scale analysis readers browse Wikipedia. Using billions page requests from Wikipedia’s server logs, measure reach articles, they transition between these patterns combine into more complex navigation paths. We find that behavior characterized by highly diverse structures. Although most paths are shallow, comprising a single pageload, there much variety, depth shape vary systematically with topic, device type, time day. show commonly mesh external pages part larger online ecosystem, describe naturally occurring distinct targeted in lab-based settings. Our results further suggest abandoned low-quality pages. Taken together, insights contribute to understanding readers’ information needs allow improving their experience on Web general.
منابع مشابه
Deriving a Large-Scale Taxonomy from Wikipedia
We take the category system inWikipedia as a conceptual network. We label the semantic relations between categories using methods based on connectivity in the network and lexicosyntactic matching. As a result we are able to derive a large scale taxonomy containing a large amount of subsumption, i.e. isa, relations. We evaluate the quality of the created resource by comparing it with ResearchCyc...
متن کاملSearch strategies of Wikipedia readers
The quest for information is one of the most common activity of human beings. Despite the the impressive progress of search engines, not to miss the needed piece of information could be still very tough, as well as to acquire specific competences and knowledge by shaping and following the proper learning paths. Indeed, the need to find sensible paths in information networks is one of the bigges...
متن کاملConstructing Large-Scale Person Ontology from Wikipedia
This paper presents a method for constructing a large-scale Person Ontology with category hierarchy from Wikipedia. We first extract Wikipedia category labels which represent person (hereafter, Wikipedia Person Category, WPC) by using a machine learning classifier. We then construct a WPC hierarchy by detecting is-a relations in the Wikipedia category network. We then extract the titles of Wiki...
متن کاملVisualizing large-scale human collaboration in Wikipedia
Volunteer-driven large-scale human-to-human collaboration has become common in the Web 2.0 era. Wikipedia is one of the foremost examples of such large-scale collaboration, involving millions of authors writing millions of articles on a wide range of subjects. The collaboration on some popular articles numbers hundreds or even thousands of co-authors. We have analysed the co-authoring across en...
متن کاملWikiReading: A Novel Large-scale Language Understanding Task over Wikipedia
We present WIKIREADING, a large-scale natural language understanding task and publicly-available dataset with 18 million instances. The task is to predict textual values from the structured knowledge base Wikidata by reading the text of the corresponding Wikipedia articles. The task contains a rich variety of challenging classification and extraction sub-tasks, making it well-suited for end-to-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on The Web
سال: 2023
ISSN: ['1559-1131', '1559-114X']
DOI: https://doi.org/10.1145/3580318